Applying Grid Technologies to XML Based OLAP Cube Construction

نویسندگان

  • Tapio Niemi
  • Marko Niinimäki
  • Jyrki Nummenmaa
  • Peter Thanisch
چکیده

On-Line Analytical Processing (OLAP) is a powerful method for analysing large warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data are required and when to perform the desired data analysis tasks. Sometimes it may happen that some parts of the data are only needed occasionally. Therefore, storing all data to the OLAP database and keeping this database constantly up-to-date is not only a highly demanding task but it also may be overkill in practice. This suggests that in some applications it would be more feasible to form the OLAP cubes only when they are actually needed. However, the OLAP cube construction can be a slow process. Here, we present a system that applies Grid technologies to distribute the computation needed in the cube construction process. As the data sources may well be heterogeneous, we propose an XML language as an interim format for collecting the data. The user’s definition for a new OLAP cube often includes selecting and aggregating the data. In our system this computation is distributed to the computers that store the original data. This reduces the network traffic and speeds up the computation that is now performed in parallel. We have implemented a prototype for the system. The implementation uses software packages called Spitfire (a data base front end) and Mobile Analyzer (a Java distributed computing platform). Both of these have their background in Grid technologies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XML encoding and Web Services for Spatial OLAP data cube exchange: an SOA approach

XML and Web Services technologies have revolutionized the way data are exchanged on the Internet. Meanwhile, Spatial OLAP (SOLAP) tools have emerged to bridge the gap between the Business Intelligence and Geographic Information Systems domains. While Web Services specifications such as XML for Analysis enable the use of OLAP tools in Service Oriented Architecture (SOA) environments, no solution...

متن کامل

GMLA: A XML Schema for Integration and Exchange of Multidimensional-Geographical Data

The integration among DW, OLAP and GIS has been given considerable attention in recent years by many researchers and industrial corporations. This may be a result of: 1) DW/OLAP can improve GIS spatial queries whereas, 2) a GIS can provide better support to deal with the DW/OLAP geographic data. Some research about this integration has already been done. However, these approaches do not deal wi...

متن کامل

A tool for data cube construction from structurally heterogeneous XML documents

Data cubes for OLAP (Online Analytical Processing) often need to be constructed from data located in several distributed and autonomous information sources. Such a data integration process is challenging due to semantic, syntactic, and structural heterogeneity among the data. While XML (Extensible Markup Language) is the de facto standard for data exchange, the three types of heterogeneity rema...

متن کامل

An interoperable XML encoding for the exchange of Spatial OLAP data cubes in SOA environments

XML and Web Services technologies have revolutionized the way data are exchanged on the Internet. Meanwhile, Spatial OLAP (SOLAP) tools have emerged to bridge the gap between the Business Intelligence and Geographic Information Systems domains. While Web Services specifications such as XML for Analysis enable the use of OLAP tools in Service Oriented Architecture (SOA) environments, no solution...

متن کامل

XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses

Recently, a large number of XML documents are available on the Internet. This trend motivated many researchers to analyze them multi-dimensionally in the same way as relational data. In this paper, we propose a new framework for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where every fact data as well as dimension data are stored as XML...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003